Designing Correlation Indices with Bucketing and Composition

نویسندگان

  • Hideaki Kimura
  • Stanley B. Zdonik
چکیده

In relational query processing, there are generally two choices for access paths when performing a predicate lookup for which no clustered index is available. One option is to use an unclustered index. Another is to perform a complete sequential scan of the table. Online analytical processing (OLAP) workloads often do not benefit from the availability of unclustered indices; the cost of random disk I/O becomes prohibitive for all but the most selective queries. Unfortunately, this means that data warehouses and other OLAP systems will frequently perform sequential scans, unless they can satisfy nearly all of the queries posed to them by a single clustered index [6], or unless specialized data structures – like bitmap indices, materialized views, or cubes – can be used to answer queries directly. We present a new index data structure called a correlation index (CI) that enables OLAP databases to answer a wider range of queries from a single clustered index or sorted file. The CI exploits correlations between the key attribute of a clustered index and other unclustered attributes in the table. We show that CIs can be implemented as an “add-on” to an existing database via a simple query rewriting scheme. In order to predict when CIs will exhibit wins over alternative access methods, we develop an analytical cost model that is suitable for integration with existing query optimizers. We also develop algorithms that search for strong candidate CIs and that recommend ways to “bucket” CIs to reduce their space utilization. We compare CI performance against sequential scans and unclustered B+Tree indices in PostgreSQL. Our results on several different data sets validate the accuracy of our cost model and establish numerous cases where CIs accelerate lookup times dramatically over other access methods. We also show that standard B+Trees can benefit from correlations just as CIs do, and that CIs are typically much smaller than B+Trees (by up to three orders of magnitude), making it possible to maintain many of them in memory over a single table.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

بررسی ارتباط مؤلفه‌های ترکیب بدنی با تعادل ایستا، پویا و سا‌بقه زمین‌خوردن در افراد سا‌لمند فعال

Objectives: The purpose of this study was to investigate relationship between body composition indices with static and dynamic balance and rate of falling in active elderly people. Methods & Materials: This research was a correlation study. Active elderly women volunteered for participation in this research (n=45). Body composition indices (body fat mass, fat free mass, body mass index, wais...

متن کامل

برآورد غیر تهاجمی حداکثر اکسیژن مصرفی (Vo2max) در کارکنان دانشگاه علوم پزشکی مازندران

Background and purpose: Maximum Oxygen Consumption (Vo2max) is a measurement for assessing cardiorespiratory fitness. Its direct measurement is an aggressive method that is technically and operationally difficult. This study investigated the correlation between this variable and the variables obtained from non-invasive methods of evaluation of body composition indices, step count, and physical ...

متن کامل

Designing a specific upper body anaerobic power test for wrestling

Abstract The aim of this study was designing a specific upper body anaerobic power test for wrestlers and determining validity, reliability and objectivity of the designed test. In order to assess the anaerobic power of wrestlers on the basis of upper body Wingate test, Twenty two wrestlers (age=23/40±3/20 year, height=173/13±6/97 cm, weight=74/55±3/88 kg) of Tehran wrestling team (most of the...

متن کامل

Correlation of dietary protein intake with body composition and physical status in patients with knee osteoarthritis

Background and Objectives: Little is known about the association between dietary protein intake and clinical manifestations in osteoarthritis (OA) patients. We aimed to determine the correlation between dietary protein intake and pain severity, functional status, and body composition indices in patients with knee OA. Materials and Methods: This cross-sectional study was performed on 220 OA pat...

متن کامل

Designing the Questionnaire of Teachers’ Work Life Quality

Background and Objectives: Quality of work life is one of the most important factors in promotion of teachers and having them continue their jobs. This study aimed at designing and evaluating a questionnaire for teachers’ work life quality.  Methods: In this research, a sequential exploratory approach (instrument editing model) was used and in the qualitative stage, a semi-structured interview...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008